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The dynamics of a wide range of real systems, from email patterns to earthquakes, display a 
bursty, intermittent nature, characterized by short timeframes of intensive activity followed by long 
times of no or reduced activity. The understanding of the origin of such bursty patterns is hindered 
by the lack of tools to compare different systems using a common framework. We introduce two 
measures to distinguish the mechanisms responsible for the bursty nature of real signals, changes in 
the interevent times and memory. We find that while the burstiness of natural phenomena is rooted 
in both the interevent time distribution and memory, for human dynamics memory is weak, and the 
bursty character is due to changes in the interevent time distribution. Finally, we show that current 
models lack in their ability to reproduce the activity pattern observed in real systems, opening up 
new avenues for future work. 
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The dynamics of most complex systems is driven by 
the loosely coordinated activity of a large number of com- 
ponents, such as individuals in the society or molecules 
in the cell. While we witnessed much progress in the 
study of the networks behind these systems [lj , advances 
towards understanding the system's dynamics has been 
slower. With increasing potential to monitor the time- 
resolved activity of most components of selected complex 
systems, such as time-resolved email [2- C3- OJ - web brows- 
ing 5], and gene expression Q patterns, we have the op- 
portunity to ask an important question: is the dynamics 
of complex systems governed by generic organizing prin- 
ciples, or each system has its own distinct dynamical fea- 
tures? While it is difficult to offer a definite answer to 
this question, a common feature across many systems is 
increasingly documented: the burstiness of the system's 
activity patterns. 

Bursts, vaguely corresponding to significantly en- 
hanced activity levels over short periods of time followed 
by long periods of inactivity, have been observed in a 
wide range of systems, from email patterns Q to earth- 
quakes and gene expression jg. Yet, often bursti- 
ness is more of a metaphor than a quantitative feature, 
and opinions about its origin diverge, fn human dynam- 
ics, burstiness has been reduced to the fat-tailed nature 
of the response time distribution 0i 0| , in contrast with 
earthquakes and weather patterns, where memory effects 
appear to play a key role Once present, burstiness 
can affect the spreading of viruses or resource alloca- 
tion Also, deviations towards regular, "anti-bursty" 
behavior in heartbeat may indicate disease progression 
|l2|. Given the diversity of systems in which it emerges, 
there is a need to place burstiness on a strong quanti- 
tative basis. Our goal in this paper is to make a first 
step in this direction, by developing measures that can 
help quantify the magnitude and potential origin of the 
bursty patterns seen in different real systems. 



Let us consider a system whose components have a 
measurable activity pattern that can be mapped into 
a discrete signal, recording the moments when some 
events take place, like an email being sent, or a pro- 
tein being translated. The activity pattern is random 
(Poisson process) if the probability of an event is time- 
independent. In this case the interevent time between 
two consecutive events (r) follows an exponential distri- 
bution, -Pp(r) ~ exp(— t/tq) (Fig. fa). An apparently 
bursty (or anti-bursty) signal could emerge if P(t) is dif- 
ferent from the exponential, such as the bursty pattern 
of Fig. lb, or the more regular pattern of Fig. lc. Yet, 
changes in the interevent time distribution is not the only 
way to generate a bursty signal. For example, the sig- 
nals shown in Fig. ld,e have exactly the same P(t) as in 
Fig. la, yet they have a more bursty or a more regular 
character. This is achieved by introducing memory: in 
Fig. Id the short interevent times tend to follow short 
ones, resulting in a bursty look. In Fig. le the relative 
regularity is due to a memory effect acting in the oppo- 
site direction: short (long) interevent times tend to be 
followed by long (short) ones. Therefore, the apparent 
burstiness of a signal can be rooted in two, mechanisti- 
cally quite different deviations from a Poisson process: 
changes in the interevent time distribution or memory. 
To distinguish these effects, we introduce the burstiness 
parameter A and the memory parameter fj,, that quantify 
the relative contribution of each in real systems. 

The burstiness parameter A is defined as 

A ^^-^ f\P { r)-Mr)\dr, (1) 

where m T and ov are the mean and the standard devi- 
ation of P(t) 13] . The meaning of A is illustrated in 
Fig. lf-h, where we compare P(r) for a bursty- (Fig. If) 
and an anti-bursty (Fig. lg) signal with a Poisson in- 
terevent time distribution. A signal will appear bursty 
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FIG. 1: (a) A signal generated by a Poisson process with a 
unit rate. (b,c) Bursty character through the interevent time 
distribution: A bursty signal (b) generated by the power- 
law interevent time distribution P(t) ~ r _1 , and an anti- 
bursty signal (c) generated by the Gaussian interevent time 
distribution with m = 1 and a = 0.1. A bursty signal can 
emerge through memory as well. For example, the bursty sig- 
nal shown in (d) is obtained by shuffling the Poisson signal 
of (a) to increase the memory effect. A more regular look- 
ing signal, with negative memory, is obtained by the same 
shuffling procedure (e). Note that signals in (a), (d) and (e) 
have identical interevent time distribution, (f) An interevent 
time distribution (solid line) will appear bursty (A > 0) if it 
has a higher frequency of long or short interevent times than 
predicted for a Poisson process (dotted line). Inset shows 
the same curves in log-log scale, (g) The signal will appear 
to be regular (anti-bursty, A < 0) if P(r) is higher in the 
average interevent time region than a Poisson process. The 
burstiness parameter A is half of the shaded area between the 
corresponding interevent time distribution (solid) and the ref- 
erence distribution (dotted), (h) The stretched exponential 
interevent distribution interpolates between a highly bursty 
(A = 1), a Poisson (A = 0), and a regular (A = — 1) signal. 
The figure shows A as a function of the parameter u. 



if the frequency of the short and long interevent times 
is higher than in a random signal (Fig. If), resulting 
in many short intervals separated by longer periods of 
inactivity. Therefore, there are fewer interevent times 
of average length than in a Poisson process. A sig- 
nal displays anti-bursty character, however, if the fre- 



quency of the interevent times is enhanced near the av- 
erage and depleted in the short and long interevent time 
region (Fig. lg). A is bounded in the range (—1,1), 
and its magnitude correlates with the signal's burstiness: 
A = 1 is the most bursty signal, A — is completely 
neutral (Poisson), and A = — 1 corresponds to a com- 
pletely regular (periodic) signal. For example, in Fig. lh 
we show A for the stretched exponential distribution, 
Pse(t) = ^(t/to) 11 " 1 exp[-(r/T ) tl ]/T , often used to 
approximate the interevent time distributions of complex 
systems ^4|. The smaller the u is, the burstier is the sig- 
nal, and for u — > 0, Pse(t) follows a power law with the 
exponent —1, for which A = 1. For u = 1, Pse(t) is 
simply the exponential distribution with A = 0. Finally, 
for u > 1, the larger u is, the more regular is the signal, 
and for u — > oo, P(t) converges to a Dirac delta function 
with A = -1. 

Most complex systems display a remarkable hetero- 
geneity: some components may be very active, and others 
much less so. For example, some users may send dozens 
of emails during a day, while others only one or two. To 
combine the activity levels of so different components, 
we can group the signals based on their average activity 
level, and measure P(t) only for components with similar 
activity level. As the insets in Fig. 2 show, the obtained 
curves are systematically shifted. If we plot, however, 
tqP(t) as a function of t/tq, where To being the average 
interevent time, the data collapse into a single curve T(x) 
(Fig. 2), indicating that the interevent time distribution 
follows P{t) = (1/t )J-(t/t ), where T{x) is indepen- 
dent of the average activity level of the component, and 
represents an universal characteristic of the particular 
system 0, Il5| . This raises an important question: will A 
depend on To? The burstiness parameter A is invariant 
under the time rescaling as f = t/tq and P(f ) = tqP(t) 
with a constant To, since A = f Q |P(t) — Po(f)\df — 

f °° |t P(t) - T Po(tMt/t ) = f °° |P(T) - P (r)\dr ee 

A, i.e., it characterizes the universal function F(x). Such 
invariance of A enables us to assign to each system a sin- 
gle burstiness parameter, despite the different activity 
level of its components. 

The memory coefficient \x of a signal is defined as the 
correlation coefficient of all consecutive interevent time 
values in the signal over a population. That is, given all 
pairs of consecutive interevent times (t^, t/^+i) for all 
components {k = 1, ■ • ■ , N}, 



N n k -l 
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(n - m kl )(T l+1 - m k2 ) 
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where N is the number of components in the system, 
rik is the number of events recorded for component k, 
and mfei(mfc2) and Oki{o~k2) are the mean and standard 
deviation of t^^'s (t^.^i's), respectively. The memory 
coefficient is positive when a short (long) interevent time 
tends to be followed by a short (long) one, and it is neg- 
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FIG. 2: Interevent time distributions P(r) for some real signals, (a) P(t) for e-mail activity of individuals from a University 
0- T corresponds to the time interval between two emails sent by the same user, (b) Interevent time distribution for the 
occurrence of letter in the text of C. Dickens' David Copperfield [lq. (c) Interevent time distribution of cardiac rhythm of 
individuals 21]. Each event corresponds to the beat in the heartbeat signal. In each panel, we also show for reference the 
exponential interevent time distribution (dotted). Unsealed interevent time distribution is shown in the inset for each dataset. 



ative when a short (long) interevent time is likely to be 
followed by a long (short) one. The measurements indi- 
cate that \x is independent of tq. 

Mapping complex systems on the (/i, A) space — Given 
that the burstiness of a signal can have two qualitatively 
different origins, the best way to characterize a real sys- 
tem is to identify its /i and A parameters, placing them 
in a (^t, A) phase diagram (Fig. 3). As a first example, 
we measured the spacing between the consecutive occur- 
rence of the same letter in texts of different kind, era, and 
language 01 . For these signals, we find A w 0, i.e., the 
interevent time distribution follows closely an exponen- 
tial (Fig. 2b) and \x ~ 0.01, indicating the lack of memory. 
Thus this signal is best described by a Poisson process, at 
the origin of the phase diagram (Fi g. 3 ). In contrast, nat- 
ural phenomena, like earthquakes [17j and weather pat- 
terns 18j are in the vicinity of the diagonal, indicating 
that P(t) and memory equally contribute to their bursty 
character. The situation is quite different, however, for 
human activity, ranging from email and phone commu- 
nication to web browsing and library visitation patterns 
[S 0, IE H3 • For these we find a high A and small or 
negligible /i, indicating that while these systems display 
significant burstiness rooted in P(t), memory plays a 
small role in their temporal inhomogeneity. This lack of 
memory is quite unexpected, as it suggests the lack of 
predictability in these systems in contrast with natural 
phenomena, where strong memory effects could lead to 
predictive tools. Finally for cardiac rhythms describing 
the time between two consecutive heartbeats (Fig. 2c) 
[13, we find A car diac, healthy = -0.73(4) for healthy in- 
dividuals and A car( jiac, chf = —0.82(6) for patients with 
congestive heart failure (CHF), both signals being highly 



regular. Thus the A parameter captures the fact that 
cardiac rhythm is more regular with CHF than in the 
healthy condition [l^. Furthermore, we find /i ss 0.97, 
indicating that memory also plays a very important role 
in the signal's regularity. 

The discriminative nature of the (/i, A) phase diagram 
is illustrated by the non-random placement of the differ- 
ent systems in the plane: human activity patterns cluster 
together in the high A, low /i region, natural phenomena 
near the diagonal, heartbeats in the high fi, negative A 
region and texts near A = (j, = 0, underlying the exis- 
tence of distinct classes of dynamical mechanisms driving 
the temporal activity in these systems. 

Following the clustering of the empirical measurements 
in the phase diagram, a natural question emerges: to 
what degree can current models reproduce the observed 
quantitative features of bursty processes? Queueing 
models, proposed to capture human activity patterns, are 
designed to capture the wai ting times of the tasks, rather 
than interevent times 0, 0,IU- Therefore, placing them 
on the phase diagram is not meaningful. A bursty signal 
can be generated by 2-state model j23(, switching with 
probability p its internal state between Poisson processes 
with two different rates Ao < Ai. A for the model is inde- 
pendent of p in the long time limit as long as p > 0, and 
takes its value in the range < A < 0.5, approaching 
when Ao « Ai and 0.5 as Ai — » oo and Ao — » 0. The 
memory coefficient of the model follows /i = ^4(0. 5 — p), 
where A is a positive constant dependent on Ao and Ai 
so that —1/3 < n < 1/3. The region in the (n,A) space 
occupied by the model is shown in the light grey area 
in Fig. 3a, suggesting that by changing its parameters 
the model could account for all observed behaviors. Yet, 
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FIG. 3: (Color) (a) The (/i, A) phase diagram. Human behaviors (red) are captured by activity patterns pertaining to email (□) 
0, library loans (o) and printing (o) [T^ of individuals in Universities, call center record at an anonymous bank (A) [let , 
and phone initiation record from a mobile phone company (V). Data for natural phenomena (black) are earthquake records in 
Japan (o) |T^| and daily precipitation record in New Mexico, USA (□) [Isf. Data for human texts (blue) [lr| are the English 
text of David Copperfield (o) and the Hungarian text of Isten Rabjai by Gardonyi Geza (n). Data for physiological behaviors 
(green) are the normal sinus rh yth m (o) and the cardiac rhythm with CHF (□) of human subjects [2l|. Grey area is the region 
occupied by the 2-state model |23j|. (b) Close-up of the most populated region. 



the agreement is misleading: for example, for human ac- 
tivities P(t) is fat-tailed, which is not the case for the 
model. This indicates that A and /i offer only a first or- 
der approximation for the origin of the burstiness, and 
for a detailed comparison between models and real sys- 
tems we need to inspect other measures as well, such as 
the functional form of P(t). It also indicates the lack 
of proper modeling tools to capture the detailed mech- 
anisms responsible for the bursty interevent time distri- 
butions seen in real systems, opening up possibilities for 
future work. 
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